Multi-task Sequence to Sequence Learning

نویسندگان

  • Minh-Thang Luong
  • Quoc V. Le
  • Ilya Sutskever
  • Oriol Vinyals
  • Lukasz Kaiser
چکیده

Sequence to sequence learning has recently emerged as a new paradigm in supervised learning. To date, most of its applications focused on only one task and not much work explored this framework for multiple tasks. This paper examines three settings to multi-task sequence to sequence learning: (a) the one-to-many setting – where the encoder is shared between several tasks such as machine translation and syntactic parsing, (b) the many-to-one setting – useful when only the decoder can be shared, as in the case of translation and image caption generation, and (c) the many-to-many setting – where multiple encoders and decoders are shared, which is the case with unsupervised objectives and translation. Our results show that training on a small amount of parsing and image caption data can improve translation quality by up to 1.5 BLEU points. Additionaly, we reveal interesting properties of the two unsupervised learning objectives, autoencoder and skip-thought, in the context of multi-task sequence to sequence learning.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Seismic Data Forecasting: A Sequence Prediction or a Sequence Recognition Task

In this paper, we have tried to predict earthquake events in a cluster of seismic data on pacific ring of fire, using multivariate adaptive regression splines (MARS). The model is employed as either a predictor for a sequence prediction task, or a binary classifier for a sequence recognition problem, which could alternatively help to predict an event. Here, we explain that sequence prediction/r...

متن کامل

Lot Streaming in No-wait Multi Product Flowshop Considering Sequence Dependent Setup Times and Position Based Learning Factors

This paper considers a no-wait multi product flowshop scheduling problem with sequence dependent setup times. Lot streaming divide the lots of products into portions called sublots in order to reduce the lead times and work-in-process, and increase the machine utilization rates. The objective is to minimize the makespan. To clarify the system, mathematical model of the problem is presented. Sin...

متن کامل

Optimum Aggregate Inventory for Scheduling Multi-product Single Machine System with Zero Setup Time

In this paper we adopt the common cycle approach to economic lot scheduling problem and minimize the maximum aggregate inventory. We allow the occurrence of the idle times between any two consecutive products and consider limited capital for investment in inventory. We assume the setup times are negligible. To achieve the optimal investment in inventory we first find the idle times which minimi...

متن کامل

The role of sleep on children's motor memory consolidation in a motor sequence task

Sleep has been shown to affect the consolidation of motor memory in adults. However, sleep-dependent consolidation is not well-specified in children as like as adults. So, the purpose of this study is to investigate the effect of sleep on childrenchr('39')s motor memory consolidation. The participants were 32 voluntary boys aging from 10 to 12 years old from city of Kerman, which had the necess...

متن کامل

Study of Explicit Knowledge Effects on Implicit Motor Learning in Older Adults

Objective: The motor skill performance and learning in older adults are of a great importance. The primary purpose of this study was to determine whether older adult’s explicit knowledge effect on implicit learning and motor performance and which are typical for rehabilitation and skills acquired in older adults. Materials & Methods: In this comparative study a serial reaction time task by u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1511.06114  شماره 

صفحات  -

تاریخ انتشار 2015